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Abstract. In this paper we obtain uniform positive lower bounds on the 
stable commutator length of elements in word-hyperbolic groups and certain 
groups acting on hyperbolic spaces (namely the mapping class group acting 
on the complex of curves, and an amalgamated free product acting on an as- 
sociated Bass-Serre tree). If G is a word- hyperbolic group that is (5-hyperbolic 
with respect to a symmetric generating set S, then there is a positive con- 
stant C depending only on 5 and on [5| such that every element of G either 
has a power which is conjugate to its inverse, or else the stable commutator 
length of the element is at least equal to C. By Bavard's theorem, these lower 
bounds on stable commutator length imply the existence of quasimorphisms 
with uniform control on the defects; however, we show how to construct such 
quasimorphisms directly. 

We also prove various separation theorems on families of elements in such 
groups, constructing homogeneous quasimorphisms (again with uniform es- 
timates) which are positive on some prescribed element while vanishing on 
some family of independent elements whose translation lengths are uniformly 
bounded. 

Finally, we prove that the first accumulation point for stable commutator 
length in a torsion-free word-hyperbolic group is contained between 1/12 and 
1/2. This gives a universal sense of what it means for a conjugacy class in 
a hyperbolic group to have a small stable commutator length, and can be 
thought of as a kind of "homological Margulis lemma" . 



1. Introduction 

Let G be a group, and let [G, G] denote the commutator subgroup. Given g e 
[G, G] the commutator length oi g, denoted c\{g), is the least number of commutators 
in G whose product is equal to g. The stable commutator length, denoted scl((7), 
is the limit of cl(g")/7i as n goes to infinity. If S [G, G] for some least positive 
integer n, define scl((7) = sc\{g"')/n, and define scl{g) = cx) if no power of g is 
contained in [G, G]; see §[2] for precise definitions. For a general introduction to the 
theory of stable commutator length, see . 

Informally, if G = T^iiX) for some topological space X, commutator length is 
the smallest genus of a surface in X whose boundary represents a given loop in 
X, and stable commutator length "rationally" measures the same quantity. At the 
homological level, stable commutator length is an filling norm with Q coefficients, 
in the sense of Gersten [TH] or Gromov pU] , 

This paper is concerned with obtaining uniform positive lower bounds on sta- 
ble commutator length in (word-) hyperbolic groups and groups acting on {S-) 
hyperbolic spaces. Morally, {S-) hyperbolic spaces are those whose geometry can 
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be efficiently probed by maps of triangles and otlier surfaces into the space, so it 
should not be surprising that they can be studied effectively with stable commu- 
tator length. However, there is a sense in which our results are counterintuitive, 
which we briefly explain. 

It is a widely observed fact that in the presence of (coarse) negative curvature, 
one can obtain upper bounds on the "size" of a surface which is efficient in some 
sense. Here size is measured with respect to some kind of norm; for instance area, 
Whitney b-norm, norm on homology etc. Efficiency might vary from context to 
context (e.g. harmonic, minimal, normal) and will depend on the way in which we 
measure the size of the surface. Our main results say that in the presence of (coarse) 
negative curvature, there is a uniform (positive) lower bound on the (homological) 
size of certain surfaces. 

We concentrate on worst-case behavior, rather than typical behavior, and our 
results are frequently sharp. Other authors have studied commutator length and 
its relation with negative curvature, especially Gromov (221 , §6.6*2. In our language, 
Gromov observes that in a word-hyperbolic group, if g G [G, G] is not torsion and 
does not have a power conjugate to its inverse, then scl((7) > (actually, Gromov 
neglects to mention the second possibility) . The key innovation in our paper is that 
our bounds are uniform, and depend only on macroscopic features of the group, 
specifically S and number of generators. Note that any hyperbolic group may be 
made 5-hyperbolic for some universally small d (say d < 10) just by increasing the 
number of generators, so our results in this sense are best possible. 

Perhaps our most striking theorem is Theorem B, a kind of "spectral gap" the- 
orem, which says that in a non-elementary torsion-free word-hyperbolic group, the 
first accumulation point for stable commutator length as a function from conjugacy 
classes to positive real numbers is between and ^. These bounds imply that 
there is a universal sense of what it means for an element in a word-hyperbolic 
group to have a small stable commutator length. This should perhaps be compared 
with Margulis' Lemma, which says that there is a universal sense of what it means 
for a closed geodesic in a hyperbolic manifold to be short. The difficulty in trans- 
porting Margulis' Lemma to geometric group theory is that geometric notions in 
a group are typically only defined up to a certain ambiguity (e.g. quasi-isometry) 
which obscures small scale geometric phenomena. The advantage of working with 
stable commutator length, and the power of our theorem, is precisely that it cap- 
tures such small scale phenomena. This comparison is more than superficial — the 
key to obtaining the sharp lower bound is to use estimates due to Mineyev which 
reproduce, in a general 5-hyperbolic space, geometric phenomena which are strictly 
analogous to G^ phenomena in Lie groups. 

We are able to give similar uniform lower bounds on the stable commutator 
length of certain elements in two other important classes of groups: mapping class 
groups of surfaces (hereafter denoted MCG(S')), and amalgamated free products. 
In general, neither kind of group is (5-hyperbolic, but each acts naturally on a cer- 
tain (5-hyperbolic space: the complex of curves, and the Bass-Serre tree respectively. 
Obtaining lower bounds on stable commutator length in mapping class groups is 
intimately tied to important problems in 4-dimensional symplectic geometry; for ex- 
ample, estimating the complexity of a symplectic 4-manifold filling a given contact 
3-manifold, or controlling the ratios of characteristic numbers (e.g. Euler charac- 
teristic and signature). This point of view has been pioneered by D. Kotschick, 
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sometimes in collaboration with H. Endo, in a series of papers which include [25| . 
[12j . [26j and |13j . From another point of view, there are relations between symplec- 
tic geometry and stable commutator length which are more intimately connected 
with invariants like Hofer length and subgroup distortion; see work of Polterovich, 
especially [33] , 

The Bavard Duality Theorem (see Theorem 12.51) gives a duality between stable 
commutator length, and certain kinds of functions on a group, called homogeneous 
quasimorphisms. A function / : G — *■ K is a homogeneous quasimorphism if it is 
homogeneous (i.e. if it satisfies /(g") = nf{g) for all 3 £ G and rt G Z) and if there 
is a least real number D{f) called the defect, for which \ f{gh) — f{g) — f{h)\ < D{f) 
for all (7, /i G G. The Duality Theorem says that obtaining lower bounds for stable 
commutator length is equivalent to constructing homogeneous quasimorphisms with 
small defects (see §[2]). Bavard's theorem is non-constructive, and uses the Axiom 
of Choice (in the form of the Hahn-Banach Theorem); however in our paper we 
are able to construct explicit quasimorphisms with small defects directly. We are 
also able to prove various separation theorems, constructing quasimorphisms with 
small defects that take big values on prescribed elements and vanish on others. 
This is a quantitative improvement on the kinds of separation theorems proved or 
conjectured by various people in the past, and we expect it to have a number of 
applications. 

Let us stress that this paper is concerned more with developing foundations, and 
understanding what (to us) seems like a fundamental algebraic/geometric inequal- 
ity, manifested in several important group theoretic contexts, than with deducing 
topological or other corollaries. 

1.1. Statement of results. We now give a brief summary of the contents of the 
paper. In §2 we give definitions, standardize notation, and recall some basic ele- 
ments of the theory of (5-hyperbolic groups and spaces, and bounded cohomology. 
We spend some time discussing Mineyev's geodesic flow space, which is a techni- 
cal tool that we find useful for establishing uniform bounds on stable commutator 
length directly by means of geometry rather than using quasimorphisms (see [j2]for 
definitions). 

There are other versions of the geodesic fiow space of a word-hyperbolic group or 
space, and that of Gromov [21] is probably best known. However, Mineyev's space 
has some features that make it technically easier to use for our purposes. 

In §3 we give the first version of our first main result, the Gap Theorem: 

Theorem A (Gap Theorem, weak version). Let G be a word-hyperbolic group that 
is 5-hyperbolic with respect to a symmetric generating set S with \S\ generators. 
Then there is a constant C{S,\S\) > such that for every a ^ G, either scl(a) > C 
or else there is some positive integer n and some b d G such that ba^"^b^^ = a". 

Note that if ba-'^b-^ = a" then scl(a) = 0. 

It is easy to produce examples of word-hyperbolic groups which contain elements 
with arbitrarily small (positive) stable commutator length, so the dependence on S 
and 1 5 1 is necessary. 

In §4 and §5 we introduce so-called counting quasimorphisms, and use them to 
prove the second (stronger) version of the Gap Theorem. In the following theorem, 
we use the notation r(a) to denote the translation length of an element a G G, as 
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measured in the Cayley graph TsiG) of G with respect to a fixed generating set S. 
It is worth keeping in mind that if a is not torsion, there is a positive lower bound 
on r(a) that depends only on S and \S\. 

Theorem A' (Gap Theorem, strong version). Let G be a word-hyperbolic group 
that is S -hyperbolic with respect to a symmetric generating set S with \S\ generators. 
Let a Cz G be a (non-torsion) element. Assume there is no n > and no b G with 
ba^^''b^^ = a". Then there is a homogeneous quasimorphism h on G such that 

(1) h{a) = 1 

(2) The defect of h is < C{S, \S\). 

Moreover, let ai <E G be a collection of elements for which T — supj T{ai) is finite. 
Suppose that for all integers n,m ^ and all elements 6 e G and indices i, there 
is an inequality 

ba'^b-^ aT 

Then there is a homogeneous quasimorphism h on G such that 

(1) h{a) — 1, and h[ai) = for all i 

(2) The defect of his < G'{5, |S'|,T) 

The weak version of the Gap Theorem follows from the strong version. Moreover, 
the first part of the strong version follows from the weak version together with an 
application of Bavard's theorem, but the implication in this direction uses the 
Axiom of Choice. Note that the constants C appearing in the two versions are 
different in either case, but related. 

The second part of this strong version of the Gap Theorem can be thought of 
as a kind of separation theorem, which can be summarized in words as follows: 
given an element a and a finite set of elements Oi ^ G whose nontrivial powers are 
never equal to a conjugate of a nontrivial power of a, one can (explicitly) construct 
a homogeneous quasimorphism that vanishes on all the and is positive on a; 
moreover, the defect of this quasimorphism can be uniformly controlled just in 
terms of the translation lengths of the at, and in terms of S and \S\. 

Such separation theorems have been pursued by several authors. For exam- 
ple, Polterovich and Rudnick |34| proved such a separation theorem for the group 
SL(2,Z), and asked if one can generalize it to hyperbolic groups; our work gives a 
quantitative positive answer to their question. 

In §6 we state and prove the Accumulation Theorem: 

Theorem B (Accumulation Theorem). Let G be a torsion free non- elementary 
word-hyperbolic group. Then the first accumulation point Soo for stable commutator 
length satisfies 

— <Soo <- 

12 - 2 

Here Soo is the first accumulation point for the values (with multiplicity) of the 
function scl on the set of conjugacy classes in G, thought of as an ordered subset 
of R. As mentioned above, we think of this theorem as a kind of "homological 
Margulis Lemma" . One can obtain similar theorems for groups with torsion, but 
the statement is not so clean or universal. 

In §7 and §8 we discuss two important examples of groups acting on (5- hyperbolic 
spaces: the action of the mapping class group on the complex of curves, and the 
action of an amalgamated free product on its associated tree. 
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Our main theorem about the mapping class group is: 

Theorem C (Mapping Class Theorem). Let S be a compact orientable surface 
of hyperbolic type and MCG(S') its mapping class group. Then there is a positive 
integer P depending on S such that for any pseudo-Anosov element a, either there 
is an < n < P and an element b G MCG(5) with ba^"b~^ = a", or else there 
exists a homogeneous quasimorphism h on MCG(5) such that h{a) = 1 and the 
defect of h is < C{S), where C{S) depends only on S. 

Moreover, let ai G MCG(S') be a collection of elements for which T ~ supj r(ai) 
is finite. Suppose that for all integers n,m ^ and all elements b G MCG(S') and 
indices i, there is an inequality 

ba'^b-^ ^ 

Then there is a homogeneous quasimorphism h on MCG(S') such that 

(1) h(a) = 1, and h{ai) = for all i 

(2) The defect of h is < C'{S,T) 

Note that if b is not pseudo-Anosov, then r(6) = 0. Moreover, for any surface S, 
there are infinitely many conjugacy classes of elements b with t(6) < 0{l/glog{g)) 
where g = genus(S') (see [15], especially Theorem 1.5). 

In the case of the mapping class group, one knows that not all the quasimor- 
phisms are of the kind promised by this theorem. Along completely different lines, 
Endo and Kotschick 12J managed to obtain lower bounds on stable commutator 
length for elements a £ S which are products of positive Dehn twists in disjoint 
curves. Such elements are reducible, and therefore fix a simplex in the complex of 
curves C(S'); by contrast, our methods do not give any information about elements 
with fixed points. 

Note that our arguments make use of a key acylindricity property proved by 
Bowditch, which is something like a bicombing in the context of C(S'). Our lower 
bounds do depend on the surface S. There is some evidence that this dependence 
is necessary: a forthcoming paper by Kotschick [27] shows that stable commutator 
length goes to zero under stabilizing genus by adding handles. 

Note that as in Theorem A', Theorem C includes a separation theorem. Endo and 
Kotschick gave examples that set some limits on how far such a theorem might be 
generalized. Firstly, they show that there are elements a G MCG(5') which are not 
pseudo-Anosov, but nevertheless for which there are homogeneous quasimorphisms 
which do not vanish on a. Secondly and conversely, they show that there are 
elements b G MCG(S') of infinite order, for which no power of b is conjugate to its 
inverse, but for which h{b) ~ for any homogeneous quasimorphism h on MCG(5'). 
These examples indicate the subtlety of the function scl on MCG(S'), and suggest 
that a multiplicity of approaches are necessary to appreciate its full richness. 

See [12] and [13] respectively for details. 

Our main theorem about amalgamated free products is: 

Theorem D (Amalgamation Theorem). Let G — A *c B. Let w be a word on 
A, B which is reduced and cyclically reduced such that \w\ > 1, and let w denote the 
corresponding element in G. Suppose that the double coset CwC, does not contain 
the element corresponding to any cyclic conjugate of w^^. Then there exists a 
homogeneous quasimorphism h on G such that hiw) = 1, and the defect of h is 
< 312. 
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One can make a similar statement about HNN extensions. 

Here a word w is said to be cyclically reduced if w is not equal to viV2Vi^ {vi ^ 0) 
as a word. The double coset condition is at first glance somewhat odd. We examine 
a particular example (of essential slopes in knot complements in 3-manifolds) in 
some detail and show why it is natural and unavoidable. 

1.2. Acknowledgments. We would like to thank Nathan Dunfield, Benson Farb, 
Etienne Ghys, Daniel Groves, Ursula Hamenstadt, Dieter Kotschick, Jason Man- 
ning, Shigenori Matsumoto, Igor Mineyev, Shigeyuki Morita, Leonid Polterovich 
and Bill Thurston. The second author appreciates the hospitality of the depart- 
ment of Mathematics at Caltech, where he visited the first author, and MPI in 
Bonn. 
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2. Background Material 

2.1. (5-hyperbolic groups and spaces. We assume the reader is familiar with 
basic elements of the theory of (5-hyperbolic groups and spaces: quasi-isometries, 
quasigeodesics, word-hyperbolicity, (5-thin triangles, translation length, etc. For a 
reference, see [?!] . 

Let G be a 5-hyperbolic group with a fixed generating set S. We denote the 
translation length of an element g e G on the Cayley graph r5(G) by T{g). This 
length is defined by 

rig) . hm ^^^^^ 

n — >OG fi 

where distance is measured in the usual way in r5(G). Observe that t is a class 
function. 

If g is torsion, T{g) — 0. Otherwise, recall that there is a constant A^(|S'|, S) and 
an integer n < N such that a power g" of g leaves invariant a bi-infinite geodesic axis 
and acts on this axis as translation through a distance nT{g). Since this distance is 
necessarily a positive integer, this implies that there is a positive constant B{\S\^ 5) 
such that T{g) > B for every non-torsion element g. See e.g. Theorem 5.1 of 14 . 

2.2. Stable commutator length. 

Definition 2.1. Let G be a group, and a e G. The commutator length of a, 
denoted cl(a) , is the minimum number of commutators whose product is equal to 
a. If a is not in [G, G], we set cl(a) = oo. The stable commutator length, denoted 
scl(a), is the lim inf 

scl(a) = hmuit 

n^oo n 

Notice scl(a) = oo if and only if the order of [a] in Hi{G;Z) is infinite. 

The functions cl and scl, like r, are class functions. Observe that cl and scl are 
monotone under homomorphisms. That is, if (f> : G H is a homomorphism, then 

scl((/)(a)) < scl(a) 



for all a G G, and similarly for cl. 
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Example 2.2 (Mirror Condition). Let G be a group, and suppose there are elements 
a,h E G and integers m such that ba^b^^ = a™. Then 

n ■ scl(a) ^ scl(a") ^ scl(6a"6"i) = sc^a™) = m • scl(a) 

and therefore scl(a) = 0. (Note that the defining property of a means that a"~™ G 
[G, G], so scl(a) ^ oo). 

Such elements can be found in Baumslag-Solitar groups. We compute 

|n| ■ r(a) = r(a") = T{ba"b-^) ^ T(a") |m| ■ r(a) 

so if T(a) is not zero (which can only happen for a torsion element when G is 
hyperbolic), this is only possible if n = ±m. The group 

(a, 6 I ba-H-^ = a,b^ = id) 

may be thought of as the (orbifold) fundamental group of the interval with mirrored 
endpoints, which is sometimes just called a mirror interval. In a hyperbolic man- 
ifold, an element of tti conjugate to its inverse is represented by a geodesic which 
"double covers" a geodesic segment with both endpoints on an orbifold stratum of 
order 2 (i.e. it double covers a mirror interval). Thus we will sometimes say that 
an element a G G for which there is no & G G and no integer n ^ for which 
ba'^b~^ = a~" satisfies the no mirror condition. 

Analogues of this condition will occur in the hypotheses of all our main theorems 
throughout this paper. 

Example 2.3. Let X be a knot complement in with genus g, and let Mp^q be 
the result of {p,q) Dehn filling on K. Then if a G Tri{AIp^q) represents the core of 
the added solid torus, scl(a) < g/p (in fact, by taking covers and tubing boundary 
components together, one can actually obtain scl(a) < (<?— l/2)/p; see |5]). If K is 
not a satellite knot or a torus knot, for all but finitely many pairs (p, q) the resulting 
manifold Mp q is hyperbolic, and its fundamental group is word- hyperbolic. (See 
e.g. 1251). 

2.3. Quasimorphisms. 

Definition 2.4. Let G be a group. A quasimorphism is a fimction 

: G^ M 

for which there is a least constant D{(l)) > called the defect, such that 

|0(a) + (/)(6)-(/)(a6)| <i?(0) 

for all a,b G G. In words, a quasimorphism fails to be linear by a bounded amount. 

A quasimorphism is homogeneous if 4'{a") = n(j)(a) for all integers n and all 
a G G. 

If is a quasimorphism on G, then one can obtain a homogeneous quasimorphism 
(p by the formula 

4>{a) = lim — — ^. 

n — ^oo Ti 

Note that the defining property of a quasimorphism (that it is "almost linear") 
implies that the limit exists. A homogeneous quasimorphism is a class function. 
The defect D(^) is related to D{(l)) by 

D($) < 2 • Z?(0) 
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see e.g. |6l, Corollary 2.59. Homogeneous quasimorphisms are often easier to work 
with than ordinary quasimorphisms, but ordinary quasimorphisms are easier to 
construct. We use this averaging procedure to move back and forth between the 
two concepts. 

We denote the vector space of all homogeneous quasimorphisms on G by Q{G). 
Observe that for any commutator [a, 6] G G and any (j) G Q{G) we have 

ma,b])\<D{<P) 

It turns out that there is an equality 

sup|0([a,6])| =i^(0) 

a, 6 

see [I], Lemma 3.6. 

Quasimorphisms and stable commutator length are related by Bavard's Duality 
Theorem (c.f. [1]): 

Theorem 2.5 (Bavard's Duality Theorem). Let G be a group. Then for any 
a e [G, G], we have an equality 

scl(a) = - sup — — 
2 0eQ(G) -0(0) 

Note that one must take the supremum over (j) G Q{G) with 0(a) ^ (and 
therefore I?(0) > 0) for this to make sense; also, if Q{G) = H^{G) then scl(a) = 
for a G [G, G] . Note further that the theorem makes sense and is true for a satisfying 
a" G [G, G] for some positive n. 

Bavard's theorem depends on the Hahn-Banach theorem and — L°° duality. 
Note that the Hahn-Banach theorem is equivalent to the Axiom of Choice. In 
particular, the quasimorphisms promised by Bavard's theorem are typically not 
constructible. Therefore we take explicit note in the sequel of when our arguments 
make use of Bavard's theorem, and when they do not. 

Example 2.6. For non-clcmcntary hyperbolic groups G, the space Q(G) has an un- 
countable dimension ([14J). But for certain groups, one can completely understand 
Q{G). For example, let G denote the universal central extension of the group of 
all orientation-preserving homeomorphisms of S^. Then Q{G) — K, generated by 
Poincare's rotation number. In particular, every non-negative real number is equal 
to scl of some conjugacy class in this group. Similarly, let T denote the univer- 
sal central extension of Thompson's group T of dyadic piecewise linear homeomor- 
phisms of . Then T is a finitely presented group which realizes every non- negative 
rational number as scl. See [19j for more about the group T and its bounded coho- 
mology. 

2.4. Mineyev's geodesic flow space. To understand stable commutator length 
in word-hyperbolic groups, one needs to control the geometry of maps of surfaces 
into 5-hyperbolic spaces. Naively, following the usual practice in hyperbolic mani- 
folds, one triangulates a surface and "straightens" the simplices, and then appeals 
to the Gauss-Bonnet theorem to control area and therefore diameter in terms of 
injectivity radius. In a (5-hyperbolic space, the straightening must be done in a 
careful way. There are at least three technical approaches to this straightening: 

(1) Gromov's geodesic flow space ([21], chapter 8) 

(2) Mineyev-Monod-Shalom's homological ideal bicombing ([30]) 



STABLE COMMUTATOR LENGTH IN WORD-HYPERBOLIC GROUPS 9 

(3) Mineyev's geodesic flow space ([29]) 
The approach in 21j is not entirely fleshed out, and the "geodesic flow" is really 
a quasigeodesic flow; therefore for our applications, Mineyev's flow space is best 
suited. 

Mineyev constructs from a hyperbolic complex X (for instance, a Cayley graph 
for a word-hyperbolic group) a geodesic flow space 3^(X), which consists of a union 
of parameterized lines joining ordered pairs of distinct points in the ideal boundary 
dX. The space ^^{X) admits a number of metrics and pseudo-metrics, of which the 
pseudo-metric is most important to us. 

The following summarizes some of the main properties of '3^{X) which we use. 

Theorem 2.7 (Mineyev). Let X,dx be a d-hyperbolic complex with valence < n. 
Then there exists a pseudo-metric space 3^(X),d^ called the flow space of X with 
the following properties: 

(1) 3^{X) is homeomorphic to {dX x dX — A) x M. The factors (p, q, •) under 
this homeomorphism are called the flowlines. 

(2) There is an W-action on 3^{X) (the geodesic flowj which acts as an isometric 
translation on each flowline {p, q, ■). 

(3) There is a Z/2Z action x ^ x* which anti-commutes with the R action, 
which satisfies d^{x,x*) — 0, and which interchanges the flowlines {p,q, •) 
and {q,p,-) 

(4) There is a natural action of Isom(X) on 3^{X) by isometrics. If g G 
Isom(X) is hyperbolic with fixed points p^ in dX then g fixes the flow- 
line {p^ ,p'^, •) of3^{X) and acts on it as a translation by a distance which 
we denote T(g). This action of Isom(X) commutes with the R and Z/2Z 
actions. 

(5) There are constants M > and < A < 1 such that for all triples 
a,b,c dX , there is a natural isometric parameterization of the flowlines 
(a, c, •), (6, c, •) for which there is exponential convergence 

d''iia,c,t), ib,c,t)) < MA* 

(6) // X admits a cocompact isometric action, and G C Isom(X) is torsion 
free, then there is a G-equivariant {K,e) quasi-isometry between 3^(X),d^ 
and X, dx • 

Moreover, all constants as above depend only on 6 and n. 

Note that although f{X) is homeomorphic to {dX x dX — A) x M, this topology 
is not induced by the pseudo-metric d^, since this pseudo- metric fails to separate 
pairs of points interchanged by the Z/2Z action. 

This is a conflation of several results in [29]. The pseudo-metric d'* is defined 
in Section 3.2 and Section 8.6 on a slightly larger space which Mineyev calls the 
symmetric join. The flow space, defined in Section 13, is a natural subset of this. 
The basic properties of the K,Z/2Z and Isom(X) action are proved in Section 2. 
The remaining properties are subsets of Theorem 44 (page 459) and Theorem 57 
(page 468). There is another natural metric on ^{X) which induces the topology 
on S'(X), and for which the R action is by bi-Lipschitz homeomorphisms. 

For our applications, the key points are that the action of hyperbolic isometrics 
on the fiowlines is by translations, and the exponential convergence of flowlines 
with common endpoints at infinity. 
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Remark 2.8. Since the flowlines (p, q, •) and •) are distance apart in , we 
may think of them as different parameterizations of the same hnes in a suitable 
quotient on which is a metric (and not just a pseudo- metric). 

Remark 2.9. In bullet (5), the synchronous parameterizations of (a, c, t) and (&, c, t) 
for which exponential convergence holds are precisely those for which (a, c, 0) is the 
point on the flowline (a,c, •) closest to b, and (6, c, 0) is the point on the flowline 
(fo, c, •) closest to a (interpreted in terms of suitable horofunctions). In particular, 
for every triple a, b, c of distinct points in 3^(X), if A is a triangle obtained from the 
union of three flowlines joining these points in pairs, the edges of A are exponentially 
close to each other away from a compact subset of A of uniformly bounded diameter 
(i.e. the diameter of the "thick part" of A is bounded independently of the choice 
of a, 5, c). 

3. The Gap Theorem, first version 

The Gap Theorem, to be proved below in its first version, says that in a word- 
hyperbolic group G, if a is an element which satisfies the no mirror condition 
(from Example 12. 2p and has a sufficiently long translation length, then the stable 
commutator length of a can be uniformly bounded from below. Example 12.21 and 
Example 12.31 together point to the necessity of both hypotheses. 

Theorem A (Gap Theorem, weak version). Let G be a word-hyperbolic group that 
is 6 -hyperbolic with respect to a symmetric generating set S with \S\ generators. 
Then there is a constant C{S, \S\) > such that for every a d G, either scl(a) > G 
or else there is some positive integer n and some b £ G such that ba~"b~^ = a". 

Remark 3.1. By Bavard's theorem, the uniform lower bound on scl(a) is equivalent 
to the existence of a homogeneous quasimorphism (f> with 0(a) = 1 for which there 
is a uniform upper bound on the defect D{(f)) depending only on S and \S\. 

Remark 3.2. If 6a""6^^ = a" for some b and positive integer n, then b^ and a" 
commute, and since G is hyperbolic, suitable powers of a and b generate an infinite 
dihedral group. 

It follows that if G is torsion free, no such element b can exist. Conversely, note 
that every torsion element a satisfies a" = a~" = id for some positive n. It follows 
that the no mirrors condition is vacuously satisfied in a torsion-free hyperbolic 
group. 

The following proof uses properties of Mineyev's geodesic flow space. We prove a 
stronger theorem in SjSl by directly constructing sufficiently many quasimorphisms. 
The second proof is logically superior to the first, since the construction is direct, 
and moreover the existence of these quasimorphisms does not depend on Bavard's 
Theorem and the Axiom of Choice. However, the first proof is more "geometric". 

Proof. Let Ts{G) denote the Cayley graph of G with respect to the generating set 
5". For any element a € G recall that r(a) denotes the translation length of a. As 
observed in § 12. li every element a e G is either torsion, or has a power a" where 
n ^ G2(|S'|,(5) that fixes an axis la, and therefore has r(a) > Gi(|S'|,i5). For the 
remainder of the proof we replace a by a suitable power a", and assume that a fixes 
an axis la. 
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For convenience, we let X be a 2-coinplex coming from a finite presentation for 
G with generating set S. Observe that Vs{G) is the 1-skeleton of the universal 
cover K . Suppose scl(a) is very small, so that there is an expression 

m 
i=l 

for some 6i,Ci S G such that m/n is small. Let S be a surface of genus to, and 
fj : Y, —> K a, simplicial map which takes the boundary to the immersed circle 7 = 
la/a"^- We would like to choose a representative surface whose area and geometry 
can be controlled by x(^) length(7). Since we need to control constants, this 
must be done carefully. 

We appeal to Mineyev's Theorem 12.71 Using this theorem, one proceeds as 
follows. One picks a 1-vertex triangulation of S, and spins the vertex along 
thereby producing an ideal triangulation whose edges can be realized as a subset in 
the quotient space 3^{Ts{G))/G. This is covered by a 7ri(E) equivariant map from 
the 1-skeleton of S to 'J{Vs{G)), where 7ri(I]) acts on JiVsiG)) by its image in G. 
Each ideal triangle in S corresponds to a union of three flowlines in 3^{Ts{G)). 

In as pointed out in Remark 12.91 each triangle consists of a thick core of 
diameter at most C3 (depending only on 5 and \S\) together with thin regions 
consisting of pairs of geodesic rays whose distance converges exponentially fast to 
zero. The image of S consists of 2|x(S)| — Am — 2 triangles. Because of the 
exponential convergence of geodesies in J', we can give S a hyperbolic metric of 
constant curvature ~k in such a way that the triangles are totally geodesic, and 
the map to S'{Ts{G)) is distance decreasing on the 1-skeleton, where k > depends 
only on S and \S\. It follows from the Gauss-Bonnet theorem that away from a 
thick part consisting of at most (4to — 2) regions whose diameters sum to at most 
(4to — 2)C3 for some constant C3, the thickness of E is bounded by a constant C4, 
where C3 and C4 depend only on \S\ and 6. In fact, by choosing C3 sufficiently 
large, we may assume C4 is as small as we like, a fact which we will not use. 

After composing with a quasi-isometry 

(j) : JiX) k 

and filling in the map on triangles, we get an induced map 

5i : E ^ X 

It doesn't really matter how the map is filled in on triangles, since we are ultimately 
only interested in the distances between points contained in the image of 9S. Again, 
by Theorem 1 2. 71 we can assume that the quasi-isometry constants of cj) depend only 
on 5 and \S\. 

Under this quasi-isometry, the constants C3 and C4 for the diameter of the 
core and the thickness of the complementary region must be replaced by analogous 
constants C3 and C4 which still depend only on 5 and \S\. The image of every 
boundary component is a quasigeodesic which is within a bounded distance of some 
translate of the axis la] it follows that for a suitable choice of cf), without changing 
the constants involved, we can assume that ai takes (9S in a 7ri(I])-equivariant way 
to a union of translates of la- 

By abuse of notation, we denote the image of this map as Si. Let a be a 
fundamental domain for the action of a" on la- By our estimates, away from a 
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subset of a of length at most {Am — 2)6*3, every point p €z a can be joined by an 
arc f3p in the image of Ei, such that the endpoints of /3p are a distance at most C4 
apart in K, and he on distinct components la and of ai(9Si). PuUing back by 
cTi and projecting to Si, we obtain a homotopically essential proper arc /9p € Ei. If 
Pi,P2 are at least distance 2C4 apart in a, we see that the pulled-back arcs /3pj, 
can be isotoped to be disjoint in Si. A surface of genus m with one boundary 
component contains at most 6m — 3 disjoint nonparallel proper essential arcs, and 
each arc has 2 endpoints. It follows that there are at most 12to — 6 components Z* 
which are joined by arcs Pp with nearby endpoints to points in a. 

For a fixed component Z* , the set of points where is close to la is (coarsely) 
connected, by convexity of quasigeodesics in J-hyperbolic spaces. Consequently 
there is a connected arc a' C a satisfying 

length(.') > length(.)-(4m-2)(C^+CD 

12?7i — 6 

which cobounds a strip R' of Si of width < C4 with some fixed Z^. 

By convexity of quasigeodesics, there is a connected subinterval a <Z a' satisfying 

length(cr) > length(cr') - 2C'^ 

such that a cobounds a strip i? of Si of width C5 depending only on S with some 
fixed Z^. The strip R is much longer than it is wide, and it makes sense to say that 
a choice of orientations on the la and Z^ agree or disagree (see Fig. [1]) : 



Figure 1. The orientations on la and Z^ might agree or disagree 
along R 

Now, since Si is oriented, the induced orientations on la and Z^ disagree. There- 
fore Z^ is a nontrivial translate of the axis la- That is, there is some c with Z^ = c{la)- 
Then e = ca~^c~^ stabilizes Z^, and moves points in (roughly) the same direction 
as a. 

Let p be the midpoint of a. Suppose that length(7) — r(a) is big compared 
to the constant C5 (e.g. length(7) > 2C5 will do). We consider the translates 
e-'"a'^(p) for 

I I ^ lcngth((T) ^ " _|_ 
~ 3 ■ length(7) ~ 36m 
where the last inequality follows from our assumptions about length(7). 
Then by our estimate on the width of the strip R, we have 

d{p,e-^a^{p))<AC5 

There are at most |S'|'*'^'* elements in the group which move the point p a distance 
< 4C5, and therefore if 

+ 1 > \S\^^^ 

then by the pigeonhole principle there are wi 7^ W2 such that 
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and therefore 



In words: if scl(a) is too small, a nontrivial power of a is conjugate to its inverse. 

It follows contrapositively that when r(a) > 2C5 and no power of a is conjugate 
to its inverse, we have scl(a) > j^|S'|~'"^^. If a is not torsion, T(a) > Ci. So for 
any a with no power conjugate to its inverse, we have an estimate 



scl(a2'^=/^^) > ^ISr'^^ 
36 



And therefore the constant 

satisfies the claim of the theorem. □ 

Remark 3.3. It follows from the proof that if a does not satisfy the no mirror 
condition, so that there is a positive n with a" = ca~"c~^ for some c, then n may 
be chosen to be less than some constant depending only on S and \S\. 

It is interesting to ask the following: 

Question 3.4. Let G be a group, and let H be the set of elements in G which 
satisfy the no mirror condition; i.e. for which a" ^ ba^'^b^^ whenever 6 G G and 
0. Under what circumstances is infag/f scl(a) positive? 

We say that a group G has the stable commutator gap property if it has the 
property described in Question l3.4l Theorem A shows that hyperbolic groups have 
the stable commutator gap property. 

Example 3.5. In [7], Comerford and Edmunds show that in a free group F, every 
nonzero element a satisfies scl(a) > 1/2 (this lower bound essentially follows from 
an earlier result of Duncan and Howie |10| and improves an earlier estimate scl(a) > 
1/6 obtained by Culler in [8]). Let G be residually free; i.e. suppose for all nonzero 
a G G there is a homomorphism : G ^ F to a free group, for which ipaia) is 
nonzero. Since scl is nonincreasing under homomorphisms, G satisfies the stable 
commutator gap property. 

4. Counting functions 

Here we review so-called counting functions, which generalize a construction 
introduced by Brooks [4] to construct quasimorphisms on free groups. See |14j . |16j 
or [2] for more details. 

Suppose G is a group with a fixed symmetric generating set S, and F := TsiG) 
is its Cay ley graph. Let w be a word in the generating set. Let a be a (directed) 
path in F, and \a\ its length. Define \a\^ to be the maximal number of times that 
w can be seen as an (oriented) subword of a without overlapping. 

Example 4.1. \xyxyx\xy = 2. \xyxyx\xyx = 1. \xxyxy\yx — 1. 

Let < 14^ < |w| be a constant. For x,y eT, define 

Cw.wix,y) := d{x,y) - M{\a\ - W\a\w), 

a 

where a ranges over all the paths from x to y. If the infimum is attained by a, we 
say a is a realizing path for Cw,w from x to y. If 7 is a geodesic from x to y, then 
define c^,, 11/(7) = Cyj,w{.x,y). 
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Fix a point x G F. Define for a ^ G 

Cww{a) ■■= Cw^wix^ax). 

Cw,w is called the counting function for the pair (w, W). Let w^^ denote the inverse 
word of w. We define 

In [13], the normalization VF = 1 is used. This is an appropriate choice of con- 
stant when w* :— ■ ■ ■ wwww ■ ■ ■ is a bi-infinite geodesic, which is the case in our 
applications throughout this paper. 

More generally, when G acts on a graph F (not necessarily properly) , we modify 
the definition of counting functions as follows. Let w be a path in F and call aw 
for a e G a copy of w or (interchangeably) a translate of w. For a path a in F, 
define \a\w to be the maximal number of disjoint oriented copies of w which can 
be obtained as subpaths of a. All other definitions are as above. These modified 
counting functions will be important in the sequel when we discuss the action of 
the mapping class group MCG(5) on the complex of curves 6(5), for some surface 
S. 

Proposition 4.2 (Lemma 3.3 |16j). If a is a realizing path for c^^w, then it is a 
{K,e)-quasigeodesic, where 

_ \M _ '2.W\w\ 

~ \w\ - W ' ^ ~ \w\ - W 
It is known that in a (5-hyperbolic space, any two [K^ e)-quasigeodesics which 
have same end points stay in the L{K, e, (5)-neighborhood of each other for some 
universal constant L (see |21)). 

Let L = L{\w\/{\w\ — W),2W\w\/{\w\ — VF), 5). Let a be a geodesic from x to 
y. From Proposition 14.21 we deduce that a realizing path for a must be contained 
in the L- neighbor hood of a. Consequently, if the L- neighbor hood of a does not 
contain a copy of w, then c^^wicy) = 0. 

Remark 4.3. We will use this fact later in our argument to avoid "reverse counting". 
Roughly speaking, let w be a word such that lu" is a geodesic. Then, for n > 0, 

Cw,W (w") > Wn 

because \w'^\w = n. 

Suppose the L-neighborhood of does not contain a copy of w^^ . Here we are 
thinking of the L-neighborhood of w", for large n, like a long narrow tube whose 
core has a definite orientation, agreeing with the orientation on w. By "a copy of 
■u;"^" here we mean a copy of w whose orientation disagrees with that of the core of 
the tube (compare with Fig. [T] and the accompanying discussion) . Then it follows 
that c„-i^n/(it;") = because for a realizing path a for c^-i^w &t iw" we must have 
|«U-i = 0. We thus obtain for all n > an inequality hw,w{w'^) > nW . 

Let D{h) be the defect of h. Then we have the following inequality: 

Proposition 4.4 (Prop 3.10 [16]). 

D{h^w) < 12L + 6W + m. 

Remark 4.5. Note that the defect only depends on W, S. If we take W = 1, then 
L depends only on 6 if \w\ > 2. In particular, the upper bound in Proposition 14.41 
depends only on S. We will take IF = 1 in all the applications in this paper. 
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5. The Gap Theorem, improved version 

We are now in a position to state and prove the improved version of the Gap 
Theorem, using counting quasimorphisms. The following theorem improves The- 
orem A in at least two ways: it does not use Bavard's theorem or Mineyev's flow 
space, and the constants in questions can be effectively estimated. 

Theorem A' (Gap Theorem, strong version). Let G be a word-hyperbolic group 
that is 6 -hyperbolic with respect to a symmetric generating set S with \S\ generators. 
Let a € G be a (non-torsion) element. Assume there is no n > and nob £ G with 
ba~^b~^ ~ a". Then there is a homogeneous quasimorphism h on G such that 

(1) h{a) = 1 

(2) The defect of h is < C{S, \S\). 

Moreover, let Oi € G be a collection of elements for which T — sup^ T(aj) is finite. 
Suppose that for all integers n,m ^ and all elements b € G and indices i, there 
is an inequality 

ba'^b-^ ^ a'T 

Then there is a homogeneous quasimorphism h on G such that 

(1) h{a) — 1, and h{ai) = for all i 

(2) The defect of h is < C'{S, |S'|,T) 

Remark 5.1. Note that in the second part of the theorem, we must state a gener- 
alization of the no mirror condition, involving both a and the elements a^. 

Remark 5.2. Hamenstadt ([23) used dynamical methods to directly construct 
quasimorphisms on word-hyperbolic groups. Similar ideas were developed by Pi- 
caud ([35]) in the special case of surface groups. It is not clear to us how to use 
these methods to obtain uniform estimates on defects. 

Proof. We follow [T3|, [T^. We assume in what follows that a is not torsion. Let N 
be a constant such that stabilizes a bi-infinite geodesic la. Note that one may 
find an N whose size can be bounded in terms of 5 and \S\. We also choose TV so 
that iVr(a) 3> L{S). Note that since T(a) can be bounded from below by a positive 
constant depending only on S and \S\, the constant N still depends only on S and 
15*1 (see the proof of Theorem A). 

After replacing a by a conjugate if necessary, we may assume that 1 ^ la. Set 
b — a^. la is an axis for b. We denote the subpath of la from 1 to 6 by la\b and the 
subpath from 1 to b~^ by la\b~^. Note that we think of these as oriented segments. 

Now let Cb be the counting function on G for the pair {la\b, 1) (note that we are 
setting W = 1 in the notation of S]). Then for any n > 0, we obtain an estimate 
Cb(fo") > n, since the subpath of la from 1 to 6" can be tiled by n disjoint translates 
of la\b. At the cost of possibly replacing TV by 2 A'' if necessary, we may assume that 
l^l^l > 2, so that L (as in depends only on S. 

Claim 1: There is a constant Ci{6, \S\) such that if A^ > Ci then there is no 
translate of la\b~^ in the L-neighborhood of la whose orientation agrees with that 

of la- 

Note that la\b~^ is a copy of la\b with the opposite orientation, so we could just as 
well state Claim 1 as saying that there is no translate oi la\b in the L-neighborhood 
of la whose orientation disagrees with that of la. We will prove the claim later; 
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we call the conelusion of the claim "no reverse counting" , and prove our theorem 
under this hypothesis. 

Take N to satisfy iV > Ci as well in the following. Note that N can still be 
chosen satisfying these criteria with size bounded from above in terms of 6 and IS*]. 

Let Cft-i be the counting function for the pair {la\b~^, 1). It follows from Claim 
1 that 

Cb-i(6") = 

for all n > 0. We define hb — Cb — cj-i , then obtain 

/i6(6") > n 

for any n > 0. 

By Proposition 14.41 and Remark 14.51 there is a constant K{S), which depends 
only on S such that 

D{hb) < K{5). 

Since b = a^, we get hb{a") > n/N for all n > 0. By averaging, we may replace 
hb by a homogeneous quasimorphism h' = hb (see ij2.3p . Then D{h') < AK{S) and 
h'{a) > 1/N. Define h — mh' for some constant m for which h(a) = 1. Then h is 
a homogeneous quasimorphism, and satisfies D{h) < 4NK{S). 

This proves the first part of the theorem, modulo Claim 1. 

The second part follows by almost the same argument. First of all, we may 
assume that Ui has infinite order (equivalently, r(ai) > 0) since every homogeneous 
quasimorphism will already vanish on torsion. Since we are looking for a homo- 
geneous quasimorphism which vanishes on a^, without loss of generality, we may 
replace each by a conjugate of a non-trivial power (< N{S, \S\)). Therefore, we 
may assume that Oi has a geodesic axis la^ , with 1 G • 

Let V be an oriented geodesic path in TsiG) from 1 to , and let he v with 
the opposite orientation. In other words, v = la\a" ■ 

A relative version of Claim 1 proves the second part of the theorem. We give 
two different proofs, which give different constants. Recall our notation b = . 

Claim 2: Suppose that N satisfies 

~ ' ' r(a,) + l 

for suitable Ci depending only on S and \S\. Then there is no copy oi la\b or la\b~^ 
in the L-neighborhood of la^ ■ 

Claim 2': Suppose that N satisfies 

N > CilS-f 

for suitable Ci depending only on S and \S\. Then there is no copy oi la\b or la\b~^ 
in the L-neighborhood of ■ 

It follows from Claim 2 or 2' that h as constructed above satisfies h{ai) = for 
all i. 

Proof of Claim 1. In fact, the proof of the claim follows by the same argument as 
the end of the proof of Theorem A. A copy of la\b~^ in the L-neighborhood of la is 
contained in an axis of some element e which is conjugate to a~^; i.e. e = ca~^c~^ 
for some c. 
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Let V denote such a copy of la\b^^, and let vq represent its initial point. The 
geodesic /„ is invariant under but not necessarily under a itself. Nevertheless, 
there is a constant C2 depending only on d such that a'^{vo) is within distance C2 
of la, for any integer i. Then for alH < A'' the element e~*a* satisfies 

d{vo,e-'a'{vo)) < 2L + 2C2 

So if iV > |5|2i'+2C2 jnust have e~'ia'i = e'^^a'^ for two distinct indices 11,12, 
and therefore the {12 — ii)-th power of a is conjugate by c to its inverse. Set 

Proof of Claim 2. We only sketch the proof of Claim 2 since the details are almost 
identical to those of the proof of Claim 1. If the L-neighborhood of la^ contains a 
copy of a arbitrarily long subpath of la, we can find a point vq for which 

d{vo, e'^'aiivo)) <2L + 2C2 + r(a) 

for some conjugate e of a or a~^, and for any < j < J for an arbitrarily big 
(fixed) J. It follows that if > j^pi+sCs+rCa) ^^^^ some non-trivial power of 
is conjugate to some (possibly trivial) power of a, say a'^. Moreover, if 

^ > \gl'2L+2C2+r{a) 

T{ai) + 1 

then fc 7^ 0, contrary to hypothesis. 

Proof of Claim 2! . This argument interchanges the roles of a and in the proof 
of Claim 2. Suppose the i-neighborhood of lai contains a copy, v, of or la^~^. 

The segment i; is a part of an axis of e = cac~^ or ca~^c~^. Let Vq be the starting 
point of V. Then for each < j < N , there exists kj such that 

d{vo,a';'e^{vo)) < 2L + 2C2 + r(a,) < 2L + 2C2 + T 

(we use the same constant C2 for a^). Therefore, if iV > |5'|2i+2C2+T^ then some 
non-trivial power of a is conjugate to some (possibly trivial) power of a,, which is 
impossible. 

This completes the proof of the theorem. 

□ 

Remark 5.3. The second statement can be used to give a lower bound of the stable 
commutator length "relative to {a,}". That is, if we can write a as a product 

a=[6i,ci]---[6„,c„]a™^---a™'= 

then we say the relative length of a is < n + k/2. The infimum of this number is 
the relative commutator length, and the liminf of the relative commutator length 
of a" divided by n as n — > 00 is the relative stable commutator length. Using our 

theorem, the relative commutator length has a lower bound of l/2D{h), where h is 
a homogeneous quasimorphism obtained in the second part for a and the a^. 

Remark 5.4. One may reinterpret the second part of Theorem A' as follows. In any 
group G, let Bi (G) denote the real vector space of group 1-boundaries (i.e. group 
1-cycles that are boundaries of group 2-chains), and for any chain "Y^tiQi G B\{G) 
define scl by the formula 



scl(^i,5i) = sup^^ 



18 



DANNY CALEGARI AND KOJI FUJIWARA 



(compare with Theorem 12. 5|) . This fmiction is a pseudo-norm, and agrees with 
scl on ordinary elements (see [6], § 2.6). It evidently vanishes on the subspace H 
spanned by cycles of the form g — hgh~^ and g" — ng, and descends to a quotient 
pseudo-norm on (G) := Bi{G)/H. Then the second part of Theorem A' implies 
that whenever G is hyperbolic, scl is a genuine norm on B^{G). 

We refer to the homogeneous quasimorphisms constructed in the proof of Theo- 
rem A' as counting quasimorphisms , by contrast with the abstract quasimorphisms 
promised by Bavard's theorem. 

It is interesting to speculate that one could use Theorem A or A' as a starting 
point to invert the word-hyperbolic Dchn surgery theory developed by Groves and 
Manning and independently by Osin (c.f. [23], [3T]). 

Corollary 5.5. Let K be a knot in of genus g. Then for any (5 > and every 
integer n > there is a constant C{S,n) > such that if Mp/g is the result of 
p/q surgery on K, and \p\ > C ■ g then every homomorphism from ni{Mp/y) to a 
torsion-free n-generator 6 -hyperbolic group is trivial. 

Proof. By construction, if a represents the core geodesic of Mp/g , we can estimate 

scl(a) < g/\p\ 

Moreover, since X is a knot in S^, TTi{Mp/g) is normally generated by a. 

Let G be 5-hyperbolic and torsion free. Then no element is conjugate to its 
inverse, and therefore there is a uniform lower bound on the stable commutator 
length of any nontrivial element in G, depending only on 5 and the size of a gener- 
ating set for G. li p : Tii{Mpfg) — > G is any homomorphism, then sc\{p{a)) < scl(a), 
and the image is normally generated by p{a). The claim follows. □ 

Remark 5.6. One knows that under a degree 1 map between hyperbolic manifolds 
of the same dimension, volume must go down. It is therefore significant in this 
corollary that the volumes of the manifolds Mp/g go up as \p\ — > oo. 

6. First accumulation point 

In a torsion free word-hyperbolic group G, stable commutator length defines a 
function scl from conjugacy classes to R. By Theorem A or A', the first accumula- 
tion point Sao for the image of this function is positive. In this section, we obtain 
universal estimates for S^o which are independent of G. 

Theorem B (Accumulation Theorem). Let G be a torsion free non- elementary 
word-hyperbolic group. Then the first accumulation point S^o for stable commutator 
length satisfies 

1 , 1 
12 - °° - 2 

Proof. To obtain the upper bound, observe that G contains a quasi-isometrically 
embedded copy of the free group on 2 generators. A nonabelian free group contains 
infinitely many conjugacy classes with scl < 1/2 (see e.g. [I]). Since the embedding 
is quasi-isometric, the image of infinitely many of these conjugacy classes stay non- 
conjugate in G. Under any homomorphism, scl cannot go up, so the upper bound 
is proved. 

An elementary argument gives a lower bound of 1/24. We follow the argument in 
the proof of Theorem A, and we adopt notation and the setup from that theorem. 



STABLE COMMUTATOR LENGTH IN WORD-HYPERBOLIC GROUPS 



19 



In any hyperbolic group, there are only finitely many conjugacy classes of elements 
whose translation length is bounded above by any constant. Therefore, we may 
assume the translation length of an element a is as long as we like. In particular, 
we can assume that there is an axis la which is geodesic and invariant under a. As 
before, let a denote a fundamental domain for a". Then length(a) = nr(a). 

We suppose, as in the proof of Theorem A, that there is a segment a d la 
satisfying 

> length(a)- (4m- 2)(C^ + Cl) _ 
12m — 6 

which cobounds a strip i? of Si of width < C5 with a translate c{la)- As before, 
there is e = ca~^c~^ which stabilizes c{la) and moves points in almost the same 
direction as a. 

If the translation length of a is long enough, then under the assumption scl(a) < 
1/24, the estimate above gives us that length(cr) is more than twice as big as a 
fundamental domain for a. For, if scl(a) < 1/24 then we can choose n, m as above 
such that n/m > 24. Now, lcngth(Q;) = nT{a), so 

71/ 

longth((T) > — ^T(a) - 0(1) > 2T(a) 

providing T(a) is sufficiently big. 
In fact, we can assume that 

length((7) 

2r(a) + ' 

where e is any number smaller than 1/24 — scl(a). In particular, we can assume 

lcngth((T) - 2r(a) = 2C(i 
where Cg is as big as we like. We parameterize a as a(i) where 

. [ -|length(g)| |length((7)| ' 
[ 2 ' 2 
Observe that for all i G [— Ce, Cq] we have estimates 

d{e-^a{a{i)),aii)) < 4C5, d{ae-\a{i)),a{i)) < 4C5 
For all — Ce < i < Cq we let /Xj € G be such that 

Ai.(a(0)) = a{{) 
It follows that for any \i\ < Cg we have 

d{ii-^e-^aiiiiaiO)),aiO)) < AC5, dC^-^ae- V»(«t(0)), a(0)) < 4C5 

If Cq is very big compared to (jS'l'^'-^'^)^ then by the pigeonhole principle there 
are ii, . . . ,i„ where n > Ce/dS'l^'-^^)^ for which MijM^^ commutes with both e~^a 
and ae~^ whenever 1 < j, k <n. Up to this point, our argument makes no use of 
the fact that G is torsion free. 

In a torsion free word-hyperbolic group, two nontrivial elements which commute 
are proportional. Since fii^fi~^^ commutes with both e~^a and ae~^ , it follows that 
e~^a and ae^^ themselves are proportional. Since they are conjugate, they have the 
same translation length, and are either equal or inverse, since G is torsion free. In 
the first case, a and commute; since they have the same (positive) translation 
length, they are either equal or inverse. Since their axes are almost oppositely 
aligned along a, they must be inverse, so a = e. But e = ca~ 
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in a torsion free group. In the second case, e^^a — ea^^ so — a^, and therefore 
e — a since G is torsion free, and we get a contradiction in any case. This proves 
the estimate Soo > 1/24. 

To get the estimate Soo > 1/12 we use Theorem 12.71 We argue as above that if 
scl(a) < 1/12 and a has sufficiently long translation length, then 

|length(cr) -T(a) I = 26*6 

is as big as we like. Let p be a point at a distance Cq from one of the endpoints of 
a. 

Let p, q be the ideal points stabilized by a, and r, s the ideal points stabilized by 
b. Then for suitable parameterizations of the flowlines (p, g, •), (r, s, •), the points 
{p,q,t) and {r,s,t) are within distance Me^'^^ for t £ [Cg, length(cr) — Cg], where 
M and A < 1 are universal constants (depending only on S and \S\) but Cg is as 
big as we like. 

This requires some explanation: by Theorem 12.71 bullet (5), for any three ideal 
points X, y, z there are parameterizations of (x, z, •) and (y, z, ■) for which 

d''{{x,z,t),{y,z,t))<MX' 

for suitable M and < A < 1. In fact, by Remark l2.91 these parameterizations are 
exactly those for which (x, z, 0) is the closest point on (x, z, •) to y, and (y, z, 0) is 
the closest point on {y,z,-) to x. Now, if {p,q^-) and (r, s,-) are flowlines which 
have long sub-segments which are distance < C5 apart, then if m is the point on 
(p, q, •) which is closest to r and n is the point on (p, q, •) which is closest to s, then 
TO,n are uniformly close to the endpoints of a. Similarly, if m',n' are the points 
on (r, s,-) closest to p,q respectively, then m' ,m are close, and so are n,n'. Now 
consider the flowline (p, s, •). This flowline converges exponentially fast to (p, •) 
along the ray from n to p. Similarly, it converges exponentially fast to (r, s, •) along 
the ray from m' to s. Hence away from a bounded neighborhood of the endpoints 
of (T, the flowlines (p, (7, •) and (r, s, •) are themselves exponentially close. This is 
the kind of convexity argument which is very standard in strictly negatively curved 
spaces; Mineyev's technology allows us to transplant it to "J. 
It follows that 

(e-ifl ■ (p, 9, Cg), (p, g, Cg)) < 2Me^^« 
By the triangle inequality, for any n we estimate 

d"" {{e-^aT ■ (p, g, Cg), (p, 9, Cg)) < 2nMe^<^^ 

and therefore 

, , 2nKMe^'^'' + e 

rfe a) < 

n 

where K, e are as in the last bullet of Theorem 12.71 The constants K, M, A, e all 
depend only on S and \S\, whereas we may choose Cg as big as we like, and n as 
big as we like. In particular, T{e~^a) may be made arbitrarily small by choosing 
Cg very big. 

On the other hand, if a is not equal to e, then T{e~-^a) > Ci > for Ci 
depending only on d and \S\. This gives a contradiction, for sufficiently big Cg 
(chosen depending on Ci). 

This contradiction proves the theorem. □ 
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Example 6.1. The upper bound 1/2 is sharp, and is realized in a nonabelian free 
group, or closed hyperbolic surface group, by [7]. 

Example 6.2. Suppose H = {hi, 112) is a non-free 2-generator subgroup of (any 
group) G. Then one can show scl([/ii, /i2]) < 1/2. However, Delzant showed ([9]) 
that in any word-hyperbolic group there are only finitely many conjugacy classes 
of non-free 2-generator subgroups. 

If G is allowed to have torsion, things become slightly more complicated. 

Example 6.3. Let 5 be a (surface) orbifold containing two orbifold points ^12,^3 of 
order 2 and 3 respectively. If a is any embedded arc in S from p2 to ps, then the 
boundary of a regular neighborhood of a has scl at most 1/12. For typical S there 
are infinitely many distinct isotopy classes of such arcs a. 

It is straightforward to see that in any word-hyperbolic group G, there is a 
positive first accumulation point for scl on conjugacy classes satisfying the no mirror 
condition (in fact this follows directly from Theorem A and Theorem A'), but we 
have not been able to show that this lower bound is independent of G, and therefore 
we pose the following 

Question 6.4. Is there a universal positive constant C such that the first accu- 
mulation point for scl on conjugacy classes satisfying the no mirror condition in a 
hyperbolic group is at least C ? 

Finally, the explicitness and universality of the constants in Theorem B, together 
with Example 16.11 motivates the following 

Question 6.5. Can the lower bound in Theorem B be improved to 1/4? 

7. Mapping class groups 

Our theorems may be generalized to groups which are not themselves hyperbolic, 
but which act suitably on (5-hyperbolic spaces. In this section and the next, we 
concentrate on two important examples. 

We show how to adapt our Gap Theorem to the action of the mapping class group 
on the complex of curves. For an introduction to this complex and its properties, 
see [55] or [3^ . The proof follows much the same outline as the proof of Theorem 
A'. A significant difference is that the action of MCG(S') on 6(5") is not proper, 
nevertheless it is weakly proper in a suitable sense , and this weak properness is 
enough. The technical tool we use is the acylindricity of the action of MCG(S') on 
C{S) as observed by Bowditch [3 . 

We denote the translation length of an element a e MCG(S') on 6(5*) by T(a). 

Theorem C (Mapping Class Theorem). Let S be a compact orientable surface 
of hyperbolic type and MCG(S') its mapping class group. Then there is a positive 
integer P depending on S such that for any pseudo-Anosov element a, either there is 
an < n < P and an element b G MCG(S') with ba^"b^^ = a", or else there exists 
a homogeneous quasimorphism h on MCG(S') such that h[a) = 1 and D{h) < G{S), 
where C{S) depends only on S. 

Moreover, let Oi G MCG(S') be a collection of elements for which T — supj r(ai) 
is finite. Suppose that for all integers n,m ^ and all elements b G MCG(S') and 
indices i, there is an inequality 

ba'^b-^ ^ aT 
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Then there is a homogeneous quasimorphism h on MCG(S') such that 

(1) h(a) — 1, and h{ai) — for all i 

(2) The defect of h is < C'{S,T) 

Remark 7.1. By Thurston's classification of surface automorphisms (see e.g. [35]) 
every element of infinite order in MCG(S') is either reducible or pseudo-Anosov. 
An element a £ MCG(S') has T(a) = on C{S) if and only if it has finite order or 
it is reducible. 

Remark 7.2. Note the reappearance of the no mirror condition. Also note the 
separation theorem; compare with the statement of Theorem A'. 

Remark 7.3. The dependence of C'(S) on S is somewhat subtle and indirect, and 
does not seem to be easy to estimate. 

Proof. The basic structure of the proof should be reasonably clear at this point. 

Let C(S') be the curve graph of S. C{S) is (5-hyperbohc, Any pseudo-Anosov 
element acts as an axial isometry. Moreover, by Bowditch [3], there exists B{S) 
such that for any pseudo-Anosov element a, has a geodesic axis in 6(5'). So, in 
particular, T(a) > 1/B. 

If there exists n > and b e MCG(S') with ba^^b^^ — a^", then one may assume 
n < P{S), where P{S) depends only on S. This follows because the action of 
MCG(S') on C{S) is acylindrical in the sense of Bowditch, 3J. Here is the precise 
statement of acylindricity: for any A > 0, there exists E, M such that for any two 
points x,y € C{S) with d{x, y) > E then there are at most M elements b € MCG(S') 
such that d{x, bx) < A, d{y, by) < A. We consider the case A = lOS. So, in the 
following we assume that for all < rt and b G MCG(S'), we have 6a"6^^ ^ a^". 

From above, one finds that there is P' > such that if n > P', then (a copy 
of the geodesic segment) a~" does not appear in the L-neighborhood of a geodesic 
axis of a^, where L is the constant from the section 2] which depends only on S in 
this setting. This is because, otherwise, one finds that a™ is conjugate to a~'" for 
some < m, which is a contradiction. 

So, there exists R{S) such that for any a as in the theorem, there exists N < R 
such that has a geodesic axis, and no reverse counting happens for . 

Let a be a geodesic axis for . Let x € a, and denote the subpath from x to 
a'^x as a^ . Let h = c — c^ be the quasimorphism constructed using the counting 
functions for the pairs (a^, 1) and (a^^, 1). Then, for any n > 0, c(a^") > n and 
c^{a^^^) = since there is no reverse counting. So, /i(a^") > n for all n > 0. We 
know D{h) < K{S) where K{S) depends only on S by Proposition 14.41 Therefore, 
given a, we can construct (by averaging h) a homogeneous quasimorphism / such 
that f(a) = 1 and D{f) < AKR, where the constant KR depends only on S. 

The argument to prove the second part is very similar to the proof of the second 
part of Theorem A'. Given a collection of elements and a uniform upper bound 
T on their translation lengths, if there is a translate of the axis of a which stays 
close to an axis of on a sufhciently long interval, then by acylindricity and the 
pigeonhole principle, we can conclude that some power of a is conjugate to a power 

of Gi. 

In slightly more detail: let la^ be an axis for a^, and let v be an axis for a 
conjugate e — cac^^ of a which stays close to lai on a sufficiently long segment f3 
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(we say how long in a moment). Let integers nj,mj be chosen for which 

\njT{ai) — mjT{a)\ < Ci 

for some fixed constant Ci. Given an upper bound on T and r(a), we can find 

at least C2 many such pairs {rij.mj) whose absolute values arc bounded by C3 
where C2 is as big as we like, and where C3 depends on T, T(a), S, C\ (and not on 
the specific element a,). Let /? be longer than C3T, (73T(a). Then (after possibly 
replacing rij by —rij for some j) we observe that 

d{a':'e^^{p),p)<C[ 

for some which depends only on Ci and S. By acylindricity and the pigeonhole 
principle, there are distinct indices (which we denote by ni, mi, n2, respectively) 
for which 

and therefore some nonzero power of is equal to some nonzero power of e, which 
is itself conjugate to a, contrary to hypothesis. 

This shows that the length /? can be bounded above in terms of 5, T, T(a). 

Note that one constructs by this argument a single quasimorphism h whose value 
grows linearly on powers of a, and which vanishes on all powers of simultane- 
ously. The fact that there are (typically) infinitely many aj on which h vanishes is 
immaterial. 

We give another proof of the second claim which makes more explicit use of 
acylindricity and gives the slightly better constants claimed in the statement of the 
theorem. 

The argument is similar to the one to show for sufficiently large N , there is no 
reverse counting for a^, since otherwise, for some < n < TV, a" is conjugate to 
o~" by the acylindricity of the action, which is impossible. Also, see the proof of 
Claim 2' in the proof of Theorem A'. 

We want to show C(j±2jv (oj) = for all i if is bigger than a constant depending 
only on S and T. Suppose Cq2jv (a^) > for some (fixed) i (the argument is precisely 
analogous for Ca-2jv). Let la^ be an axis for a,, and let Ig be an axis for a conjugate 
e = cac~^ of a which stays close to, namely in the L((5)-neighborhood of, la^ on a 
segment v such that one endpoint is and the other end point is e^^(wo). 

For simplicity, we assume both lai^le are geodesies (replace a, by a^,af if 
necessary). Then v is 2(5-close to la^, where S is the hypcrbolicity constant for G{S) 
(here we assume v is much longer than S, which follows if we take N bigger than a 
constant depending only on S). 

We observe that for all < j < N, there exists rij such that 

d{a^'e^{vo),vo), d(o^^e^+^(t;o), e^(t;o)) < T{ai)+4S <T + 4S 

We set A = T + AS and obtain corresponding constants D{A),M{A) for the 
acylindricity of the action. Now, assume N is such that d{e^{vo),vo) > D{A). In 
other words, A/'r(a) > D{A) (we know T(a) > 1/B, so take N > BD). Then, by the 
acylindricity, there are at most M distinct elements in a"^ eJ (0 < j < N) . It follows 
that if A > M, then some non-trivial power of a is conjugated by c to a (possibly 
trivial) power of Oj, which is impossible. We thereby obtain an upper bound for N 
by A, D{A), M{A), B, which depend only on S, T, to have c„2n (a,) > 0. We obtain 
the same upper bound from Ca-2N(aj) > as well. 

□ 
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8. AMALGAMATIONS 

In this section we adapt our theorem to the special case of an amalgamated free 
product acting on its associated tree, and construct many quasimorphisms with 
uniform lower bounds. One must be slightly careful: the group SL(2, Qp) is an 
amalgam of two copies of SL(2, Zp). Nevertheless, SL(2,Qp) is uniformly perfect, 
and therefore admits no nonzero homogeneous quasimorphisms at all. As in the 
case of the mapping class group acting on 6(5*), one must ensure (by fiat) that the 
action of the amalgam on its associated tree is weakly proper, this is guaranteed by 
a suitable malnormality condition. 

Theorem D (Amalgamation Theorem). Let G — A *c B. Let w be a word on 
A, B which is reduced and cyclically reduced such that \w\ > 1, and let w denote the 
corresponding element in G. Suppose that the double coset GwC, does not contain 
the element corresponding to any cyclic conjugate of . Then there exists a 
homogeneous quasimorphism h on G such that h(w) ~ 1, and the defect of h is 
< 312. 

Proof. We use [T7|. Here is a review. Let F be the Cayley graph of G for the 
generating set AU B. 

Then a geodesic between 1 and a is exactly a word for a reduced as a word 
in A *c B (Lemma 3.1 17[). Let w be a reduced word for a. Assume that w is 
cyclically reduced. Then w* = ■ ■ ■ www ■ ■ ■ is an infinite reduced word, which is a 
geodesic in F since w is cyclically reduced. For such let c, c_i be the counting 
function for {w, 1), {w~^ , 1). Then, 0(0") > n for all n > 0. 

By the double coset condition, c_i(a") = for all n > 0, namely, no reverse 
counting. The argument is essentially in [T7]. Suppose c_i(a") > 0. Then, there 
is a realizing path a from 1 to a" which contains w~^ as a subword. It is shown 
in Lemma 3.2 [T7] that one can always take a realizing path to be a geodesic. Let 
P = w", which is a geodesic from 1 to a". Since both reduced words a and /3 
represent the same element a" , by Britton's lemma, there must be c,c' ^ G such 
that owe' — V, where v is some cyclic conjugate of w~^ . (see section 4 [17] for 
details). But this is prohibited. 

It follows that /i(a") > n, where /i = c — c_i. It is shown in Proposition 3.1 [T7] 
that D{h) < 78. When we make a quasimorphism homogeneous, an upper bound 
for the defect becomes 312 = 78 x 4. □ 

Remark 8.1. There is no implied suggestion that the constant 312 is optimal! The 
main point is that it is a universal constant, which does not depend in any way on 
the group G. 

Remark 8.2. If C is trivial, so that G is a free product, Bavard ([T], Prop. 3.7.2) 
obtains the (optimal) lower bound scl > 1/2. 

Remark 8.3. Note that the double coset condition is the precise analogue of the no 
mirror condition in the context of amalgamated free products. 

Question 8.4. What is the optimal constant in Theorem D? 

The following example clarifies the geometry of the double coset condition in the 
context of 3-manifold topology. 

Example 8.5. Let M be a knot complement in a rational homology sphere. After 
choosing an orientation for the knot and for M, there is a natural choice of meridian 
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and longitude rn, I on dM. The longitude is defined by the property that it is 
virtually trivial in homology. So there is some surface S of least Euler characteristic 
in A'l whose boundary is a multiple of the longitude. 

Suppose [dS] = n ■ [I] in homology. Define XQ — "^rr^- Let Mp/g be the result 
of Dehn filling M along the slope p/q. Let 7 C Mp/g denote the core of the added 
solid torus. As in Example 12.31 we have an estimate for stable commutator length 



scl(7) < 




As before, for K a knot in S^, this estimate becomes 

scl(7) < 

P 

where g is the genus of a Seifert surface for K. 

A slope p/q on dM is said to be a boundary slope if there is an essential, oriented, 
proper surface Y. C M such that is a nonempty union of curves isotopic to the 
p/q curve, with either orientation. 

After filling M to Mp/g, the manifold Mp/g splits along a surface S' obtained 
by filling in the boundary components of S, into two submanifolds which by abuse 
of notation we denote Ma, Mb with fundamental groups A,B. For brevity, we 
denote 7ri(E) = C. The core geodesic 7 intersects the two submanifolds Ma, Mb 
efficiently, in a collection of proper arcs which represent elements of the double 
coset spaces CAC and CBC . If the volume of M (and therefore that of Mp/g) is 
small, and the area of E' is large, most of Ma, Mb must be very thin, and have 
the structure of an /-bundle over some subsurfaces E^, E^ of E' = OMa = OMb- 
These /-bundles are known as the characteristic submanifolds of Ma , Mb and we 
denote them by Ia,Ib respectively. 

If E' is connected, the boundary components of Ia and Ib are contained in a 
single surface. Suppose that OIa and OIb are connected. The geodesic 7 is de- 
composed into a sequence of geodesic segments jj with < j < z which are the 
connected components of 7 n {Mp/g — E'). Each 7^ is properly embedded in Ma 
or Mb, and is contained in the corresponding characteristic submanifold I a or Ib- 
Since we are assuming dlA,dlB are connected, each oriented geodesic segment 7^ 
can be dragged around Ia (say) into itself in such a way that the orientation is 
reversed at the end. Composing these proper isotopies, we can drag all the 7j si- 
multaneously in such a way that the common endpoints of 7j and 7j-i agree during 
the dragging, for each j < i. The initial point of 70 and the final point of 7^ will not 
necessarily follow homotopic paths under this dragging, and their difference is an 
element of 7ri(E'). Translating this into algebraic information, we have exhibited a 
conjugate of 7 as an element of the double coset space 7ri(E')7~^7ri(E'). 

This example actually occurs: Nathan Dunfield [TT| has produced examples of 
Montesinos knots with genus 1 for which p/1 is an essential slope where p ~ 20, 000. 
The corresponding core geodesies 7 satisfy scl(7) < and therefore by Theorem 

D, 7 is conjugate into 7ri(E')7~"'^7ri(E'). 

The double coset condition shows us how to think about the geometry of the 
resulting manifolds, and the way in which 7 sits inside them. 
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